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Abstract 

In this report, a multiple state estimation method for reinforcement learning model is proposed. 
The steering maneuver of a vehicle is learned from a reward. The reward was evaluated which the. vehicle is 
on the road or not. The reward and control signal were calculated by multiple state using a vehicle dynamics 
model. Prom the simulation result, this model can drive on unknown road configuration or velocity condition. 
This model also explain a gaze control by changing information using a control policy or an environment. 
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reward transition 
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Trajectory lor obstacle avoidance 
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